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SITE-SPECIFIC HOMOGENOUS MODIFICATION OF POLYPEPTIDES 
TO FACILITATE COVALENT LINKAGES TO A HYDROPHILIC MOIETY 



5 

The present invention relates generally to polypeptides 
modified by the attachment thereto of compounds having amine 
10 reactive groups , methods for producing such modified polypeptides 
and compositions containing the modified polypeptides. More par- 
ticularly, the invention relates to homogeneous modified polypep- 
tides which are modified by attachment of hydrophilic moieties, 
including polymers , to selected positions in the polypeptide. 

15 

BACKGROUND 

The desirability of modifying biologically active and 
therapeutically useful polypeptides with a variety of compounds 
having amine reactive groups, such as hydrophilic polymers, e.g., 

,20 polyethylene glycol (PEG) , to enhance their pharmacokinetic 
properties has been noted. See, e.g., the discussion of the art 
in this area of polypeptide modification in published PCT patent 
application W087/00056. Such modification has been attempted to 
reduce adverse immune response to the polypeptide, increase the 

25 solubility for use in pharmaceutical preparations, and/or 
maintain a desirable circulatory level of such polypeptide for 
therapeutic efficacy. 

One significant problem not addressed by the extensive art 
in this area of polypeptide modification involves the extent to 

30 which a polypeptide can be modified by attachment of compounds 
having amine reactive groups. For example, treatment of a poly- 
peptide with PEG or similar polymers, can result in random 
attachment of the polymer at the amino terminus of the polypep- 
tide and/or at one or more lysine residues in the amino acid 

35 sequence of the protein. While several PEG groups can attach to 
the polypeptide, the end result is a composition containing or 
potentially containing a variety of species of "PEG-ylated" 
polypeptide. Such heterogeneiety in composition is undesirable 
for pharmaceutical use. 



WO 89/05824 



PCT/US88/04633 



10 



15 



The attachment of compounds with amine reactive groups to a 
polypeptide may alter the biological activity of the polypeptide. 
This effect is believed mediated by the position and number of 
the attachment site(s) along the polypeptide sequence. There 
thus remains in the art a need for a method enabling site 
specific attachment of such compounds to polypeptides, in a 
manner that enables the manipulation of the number and position 
of attachment sites. Such site specific attachments can generate 
homogeneously modified polypeptides which are therapeutically 
efficacious and which retain certain desirable characteristics of 
the natural polypeptides. 



Summary of the TnvpnHnn 

This invention provides materials and methods for site 
specific covalent modification of polypeptides permitting the 
production of compositions comprising homogeneously modified 
polypeptides or proteins and pharmaceutical Compositions 
containing same. "Homogeneously modified" as the- term is used 
herein means substantially consistently modified only at specific 
2D lysine residues. A homogeneously modified G-CSF, for example, 
includes a G-CSF composition which is substantially consistently 
modified at position 40, but not at positions 16, 23 and 34. 

To solve the problem of non-specific susceptibility of 
polypeptides to covalent modification by amine-reactive moieties, 
25 this invention first provides lysine-depleted variants («LDVs») 
of polypeptides of interest. LDVs of this invention encompass 
polypeptides and proteins which contain fewer reactive lysine 
residues than the corresponding naturally occurring or previously 
known polypeptides or proteins. The lysine residues in the 
30 peptide structure of the LDVs may occur at one or more amino acid 
positions occupied by lysine residues in the natural or 
previously known counterpart, or may be located at positions 
occuppied by different amino acids in the parental counterpart. 
Furthermore, LDVs may in certain cases contain more lysine 
35 residues than the parental counterpart, so long as the number pf 
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lysine residues in the LDV permits homogeneous modification by 
reaction of the LDVs with amine-reactive moieties, as discussed 
below* Since such, polypeptides or proteins of this invention 
contain a small number of lysine residues, generally six or less, 
5 preferably 1 — 4 lysines, they are also referred to herein as 
M LDVs 11 even though containing more lysine residues than the 
parental counterpart. 

Polypeptides of interest include both proteins and poly- 
peptides, preferably human, useful in therapeutic, prophylactic 

10 and/ or diagnostic applications, including hematopoietins such as 
colony stimulating factors, e.g. G-CSF, GM-CSF, M-CSF, CSF-1, 
Meg-CSF, erythropoietin (EPO) , IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, 
B-cell growth factor, B-cell differentiation factor, eosinophil 
differentiation factor, erythroid potentiating activity (EPA) , 

15 macrophage activating factor, HILDA, interferons and tumor 
necrosis factor, among others; thrombolytic agents such as tPA, 
urokinase (uPA) and streptokinase and variants thereof as are 
known in the art; proteins involved in coagulation and 
hemostasis, including Factor V, Factor VTI, Factor VIII, Factor 

20 IX, Factor XIII, Protein C and Protein S; proteins and 
polypeptides useful as vaccines; as well as other proteins and 
polypeptides and analogs thereof, including for example 
superoxide dismutase (SOD) (including extracellular SOD) ; growth 
hormones such as human and bovine growth hormone, epidermal 

25 growth factor, fibroblast growth factors, transforming growth 
factors TGFa and TGF0, insulin-like growth factor, PDGF, and 
ODGF; pulmonary surfactant proteins (PSPs) ; calcitonin; 
somatostatin; catalase; elastase; inhibins; angiogenic factors; 
atrial natriuretic factor; FSH, LH, FSH-releasing hormone, LH- 

30 releasing hormone and HCG; immunotoxins and immunoconjugates; 
anti-thrombin III; bone or cartilage morphogeriic factors; and CD- 
4 proteins* In order to provide additional disclosure concerning 
exemplary proteins mentioned above and their uses, the following 
published foreign applications and co-owned pending U.S. 

35 applications are hereby incorporated by reference herein: PCT 
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Nos. WO 86/00639 and WO 85/05124; and U.S. Serial Nos. 940,362; 
047,957; 021,865; and 099,938. Sequence information for other 
proteins mentioned above are also known in the art. 

Most proteins and polypeptides contain several lysine 
residues within their peptide structure. By "lysine depleted 
variant" as the term is used herein, r mean variants of proteins 
or polypeptides which are modified in amino acid structure 
relative to naturally occurring or previously known counterparts 
in one or more of the following respects: 

(i) at least one lysine residue of the natural or previously 
known compound is deleted or replaced with a substitute amine 
acid, preferably arginine; 

(ii) at least one lysine residue is inserted into the natural or 
previously known sequence and/or is used to replace a different 
amino acid within that sequence; and, 

(iii) the first amino acid at the N-terminus of te mature 
20 polypeptide is preferably proline, which is a relatively non- 
reactive amine, or is reversibly blocked with a protecting group. 

With respect to modification (i) , above, it is typically 
preferred in the case of lymphokines and other proteins of like 
molecular size that all but 1-6 of the original lysines be 
deleted and/or replaced. In general, for consistent homogeneous 
modification of the LDVs the fewer lysines remaining in the LDV 
the better, e.g. only 1—4 lysines. It. should be understood, 
however, that in certain cases LDVs containing more than -4—6 
reactive lysines may, given appropriate location and spacing of 
such lysines, be capable of homogeneous modification, e.g. 
PEGylation, and upon such modification may possess advantageous 
biological properties such as differential binding to receptors, 
antibodies or inhibitors relative to the parental protein, as 
35 discussed below, it should also be understood that in accordance 
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with modification (ii) , above, one or more additional lysine 
residues may be inserted into the natural or previously known 
sequence and/or used to replace as desired other amino acids 
therein, e.g. arginine. Thus all lysines may be deleted or 
5 replaced in accordance with (i) , and one or more new lysines may 
be inserted or used to replace a different amino acid in the 
molecule. Alternatively, all but one or two, for example, of the 
lysines in the natural or previously known sequence may be 
deleted or replaced with other amino acids, e.g; arginine. In any 

10 event, and as described in greater detail below, the LDVs of this 
invention make it possible for the first time to produce homo- 
geneous compositions containing polypeptides or proteins (LDVs) 
substantially specifically and consistently modified at selected 
positions using amine-reactive moieties (described hereinafter) 

15 as the modifying agents. 

Thus, in the practice of this invention, lysine residues are 
identified in those portions of the polypeptide where 
modification via amino-reactive moieties is not desired. The 
lysine residues so identified are deleted or replaced with 

20 different amino acids, e.g. by genetic engineering methods as 
described below. Preferably replacements are conservative, i.e. 
lysine is replaced by arginine, and where a hew lysine is to be 
introduced, arginine by lysine. Any remaining lysine residues 
represent sites where modification by amine-reactive moieties is 

25 desired. Alternatively, or in addition, novel lysine residues 
may be engineered into the polypeptide at positions where 
attachment is desired, most conveniently, for example, by simple 
insertion of a lysine codon into the DNA molecule a-t the desired 
site or by converting a desirably located arginine or other codon 

30 to a lysine codon. Convenient methods for (i) site specific 
mutagenesis or DNA synthesis for producing a DNA molecule 
encoding the desired LDV, (ii) expression in procaryotic or 
eucaryotic host cells of the DNA molecule so produced, and (iii) 
recovery of the LDV produced by such expression are also 

35 disclosed in detail below. 
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The LDVs of this invention retain useful biological 
properties of the natural or previously known polypeptide or 
protein, and may thus be used, with or without modification with 
amine-reactiv. moieties, for applications identified for the non- 
modified parent polypeptide or protein. Modification with such 
moieties, however, is preferred. such modified LDVs are 
producable in homogeneous compositions which, it is contemplated, 
will provide improved pharmacokinetic profiles and/or solubility 
characteristics relative to the parent polypeptides. 

_ in cases where the parental polypeptide normally can 
interact with one or more receptors, as in the case of n- 2 for 
•rample, it is contemplated that modified LDVs of the polypeptide 
Wherein the modification masks one or more receptor binding sites 

ilratT,t a * 9 ' Wlt!l ^ ^ °* ltS I-- not 

S£V " ^ tyPSS ° f «hich interact 

with the .parental polypeptide, such modified LDVs may represent 
therapeutic agents having more specific biological and pharmaco- 

c Zl *** f e Polypeptide, in 

Z PSrental f 01 ""*"** normally interact with 

modif » M *" *"* ° £ tt iB ^templated that 

mc^fied LDVs of such polypeptides or proteins wherein the 
modification masks an iittibitor binding sit. may have a reduced 
or substantially, abolished interaction with the inhibitor^ 
thus improved utility as a therapeutic agent, in cases where the 
wise" »»*•*» can elicit neutralizing or other- 

wise inhibitory antibodies in humans, as in the case of Factor 
VIII, modified LDVs wherein the modification masks the epitope 
f SUeh «»«*o«" may represent the first potential 
therapeutic, and indeed, life saving, agents. Finely. Jn.r 

% szsz^jrrrr or otherwise 

.,. , , ° ut ""y °C a protein, as in the case of the 

APe cleavage site in Factor VIII or the proteolytic cleavage site 
in prourokinase which liberates the kringle region from Z 
serine protease domain, modified LDVs of the protein wherein the 
modification masks the cleavage site may represent potential 



WO 89/05824 PCT/US88/04633 

therapeutic agents with longer effective in vivo half life or 
other improved properties relative to the parental protein. 

Biological activity of the LDVs before or after modification 
with the amine-reactive moieties may be determined by standard in 
5 vitro or in vivo assays . conventional for measuring activity .of 
the parent polypeptide. 

Selective and homogeneous modification of the LDVs with 
amine-reactive moieties is possible since such moieties will 
covalently bond only to * -amino groups of the remaining lysine 
10 residue (s) in the LDVs and to the amino terminus of the LDV, if 
reactive. The modified LDVs so produced may then be recovered, 
and if desired, further purified and formulated with into 
pharmaceutical compositions by conventional methods. 

It is contemplated that certain polypeptides or proteins may 
15 contain one or more lysine residues, which by virtue of peptide 
folding or glycosylation, for example, are not accesible to 
reaction with amine-reactive moieties, except under denaturing 
conditions. In the practice of this invention such non-reactive 
lysine residues may be, but need not be, altered since they will 
20 not normally be susceptible to non-specific modification by 
amine-reactive moieties. The presence in parental polypeptides 
or proteins of non-reactive lysine residues may be conveniently 
determined, if desired, by modifying the parental polypeptide or 
protein with an amine-reactive compound which results in the 
25 attachment to reactive lysines of a modifying moiety of known 
molecular weight under denaturing and non-denaturing conditions , 
respectively, and determining, e.g. by SDS-PAGE analysis, the 
number of attached moieties in each case. The presence and 
number of additional attached moieties on the denatured parental 
30 polypeptide relative to the non-denatured parental polypeptide is 
a general indication of the presence and number of non-reactive 
lysine residues. The locations of any such non-reactive lysine 
residues may be determined, e.g. by SDS-PAGE analysis of 
proteolytic fragments of the polypeptide modified under 
35 denaturing and non-denaturing conditions. Lysine residues which 
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are modified sometimes but not always under the reaction 
conditions selected for the practice of this invention are deemed 
reactive lysine residues for the purpose of this disclosure. 

Amine-reactive moieties include compounds such as succinic 
5 anyhydride and polyaikylene glycols, e.g. polyethylene and 
polypropylene glycols, as well as derivatives thereof, with or 
without coupling agents or derivatization with coupling moieties, 
e.g. as disclosed in U.S. Patent No. 4,179,337; published 
European Patent Application No. 0 154 316; published 
10 International Application No. WO 87/00056; and Abuchowski and 
Davis, in "Enzymes as Drugs" (1981), Hokenberg & Roberts, eds. 
(John Wiley & Sons, NY) , pp. 367-383. 

Generally, the method for modifying the LDVs can be depicted 
as follows: 



15 



20 



25 



30 



35 



(Lys) n + >n(Y-Z) > (Lys) n 

Y 

wherein " « represents the polypeptide backbone of the LDV, 

"Lys" represents a reactive lysine residue within the polypeptide 
sequence, «y- Z " represents the amine-reactive moiety, «Y« 
represents a hydrophilic moiety which becomes covalently linked 
to the c -amino group of the lysine residue (s) in the course of 
the depicted reaction; and "n" is an integer. 

Briefly, the method comprises reacting the LDV with an amine 
reactive compound under suitable conditions, preferably non- 
denaturing conditions, and in sufficient amounts permitting the 
covalent attachment of the hydrophilic moiety to lysine 
residue(s) present in the polypeptide backbone of the LDV. 
Generally, the amount of amine-reactive compound used should be 
at least eguimolar to the number of lysines to be derivatized, 
although use of excess amine-reactive compound is strongly 
preferred, both to improve the rate of reaction and to insure 
consistent modification at all reactive sites. The modified LDV 
so produced, may then be recovered, purified and formulated by 
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conventional methods. See e.g., WO 87/00056 and references cited 
therein 

While any polypeptide is a candidate for the method of the 
invention, presently desirable polypeptides to be homogeneously 
5 modified include lymphokines and growth factors. Of significant 
interest are those polypeptides which affect the immune system, 
including the colony stimulating factors, and other growth 
factors. 

Other aspects of the present invention include therapeutic 
10 methods of treatment and therapeutic compositions which employ 
the modified polypeptide LDVs of • the present invention. These 
methods and compositions take advantage of the improved 
pharmacokinetic properties of these modified LDVs to provide 
treatments, e.g., such as employing lower dosages of polypeptide, 
15 less frequent administration, and more desirable distribution, 
required for the therapeutic indications for the natural 
polypeptide. 

Other aspects and advantages of the present invention will 
be apparent upon consideration of the following detailed 
20 description of the invention, including illustrative examples of 
the practice thereof. 

BRIEF DESC RIPTION OF THE} DRAWINGS 

25 Fig. 1 is the polypeptide sequence of IL-2 , with amino acid 
numbers used for reference in the specification. 

Fig. 2 is the polypeptide sequence of IL-3 , with amino acid 
numbers used for reference in the specification. 

Fig. 3 is the polypeptide sequence of IL-6, with amino acid 
30 numbers used for reference in the specification. 

Fig. 4 is the polypeptide sequence of G-CSF,with amino acid 

numbers used for reference in the specification. 

Fig. 5 illustrates synthetic oligonucleotides for the preparation 

of synthetic DNA molecules encoding exemplary IL-2 LDVs of the 
35 invention; odd numbered oligonucleotides correspond to sequences 
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within sense strands, even numbered oligonucleotides to anti- 
sense strands; the initiation ATG is marked with ••***» and 
altered codons are underlined; oligonucleotides in Pig. 5A yield 
the Ldv with alanine at position 125 and oligonucleotides in Fig. 
5 5B yield the LDV with cystein at position 125. 

DETAILED PES CRTPTT n w pp tor TWVPMnnv^ 

The present invention involves the selective modification of 
polypeptides of interest for pharmaceutical use, to both enhance 
10 thexr pharmacokinetic properties and provide homogeneous 
compositions for human therapeutic use. Any polypeptide is 
susceptible to use in the method of the invention. Most 
desirably, a polypeptide having one or more lysine residues in 
its ammo acid sequence, where it would be desirable to attach an 

h!vf " aCtiVS COmPOUnd ' * ay be -Ployed. Also polypeptides 
having arginine residues which may be converted to lysine 
residues for such attachments may be employed. Lysine residues 
*ay also or alternatively be inserted into, or used to replace ■ 
^ endogenous amino acid residues, in a polypeptide a sequence which 

rtnalT T V : niently loCated ^ine or arginine residues. 
Finally, lysine residues may be used to replace asparigine 

sites 6 " I , *!r d f" * C ° nSenSUS N - linked Sanation 

sites, m the latter case, the LDVs, even when expressed in 

25 ll \ C6llS ref ° lded if Hecessa ^ or desired) , may be 

o^Zl ^ V "* diSClOSed herein at ° ne « -re locations 
otherwxsed glycosylated when expressed in eukaryotic cells. 

The method for selectively modifying the polypeptide of 

for^eTT SeleCtlng l0Cati ° nS ^ Polypeptide sequence 
30 I ^tachment of amine reactive compounds. This step may be 

toZZlTl" ^ the -id sequence'of'the 

polypeptide by converting selected lysine residues into arginine 
rescues, or converting selected arginine residues into lysine 
residues. For example, the codons AAA or AAG, which code for 

5 CGG wliT ^ Chan9ed t0 ^ COd ° nS AGA ' AGG, C6A, CGT, CGC, or 
WhlCh COde for arginine, and vice versa. Alternatively, 
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lysine residues may be inserted into and/or deleted from a 
peptide sequence at a selected site(s) . 

LDVs in accordance with this invention also include proteins 
with allelic variations , i . e . sequence variations due to natural 
5 variability from individual to individual , or with other amino 
acid substitutions or deletions which still retain desirable 
biological properties of the parental protein or polypeptide. 

All LDVs of this invention may be prepared by expressing 
recombinant DNA sequences encoding the desired variant in host 

10 cells, e.g. procaryotic host cells such as E. coli, or eucaryotic 
host cells such as yeast or mammalian host cells, using methods 
and materials, e.g. vectors, as are known in the art. DNA se- 
quences encoding the variants may be produced synthetically or by 
conventional site-directed mutagenesis of DNA sequences encoding 

15 the protein or polypeptide of interest or analogs thereof. 

DNA sequences encoding various proteins of interest have 
been cloned and the DNA sequences published. DNA sequences 
encoding certain proteins of interest have been deposited with 
the American Type Culture Collection (See Table 1) . DNA 

20 molecules encoding a protein of interest may be obtained (i) by 
cloning in accordance with published methods , (ii) from deposited 
piasmids, or (iii) by synthesis, e.g. using overlapping synthetic 
oligonucleotides based on published sequences which together span 
the desired coding region. 

25 As mentioned above, DNA sequences encoding individual LDVs 

of this invention may be produced synthetically or by 
conventional site-directed mutagenesis of a DNA sequence encoding 
the parental protein or polypeptide of interest or analogs 
thereof . Such methods of mutagenesis include the M13 system of 

30 Zoller and Smith, Nucleic Acids Res. 10.:6487 - 6500 (1982); 
Methods Enzymol. 100:468-500 (1983); and DNA 3:479-488 (1984), 
using single stranded DNA and the method of Morinaga et al., 
Bio/ technology, 636-639 (July 1984), using heteroduplexed DNA. 
Exemplary oligonucleotides used in accordance with such methods 

35 are described below. It should be understood, of course, that 



WO 89/05824 



PCT/US88/04633 



12 



DNA encoding each of the LDVs of this invention may be 
analogously produced by one skilled in the art through 
site-directed mutagenesis using appropriately chosen 
oligonucleotides. 

5 The new. DNA sequences encoding the LDVs of this invention 

can be introduced into appropriate vectors for heterologous 
expression in the desired host cells, whether procaryotic or 
eucaryotic. The activity produced by the transiently transfected 
or stably transformed host cells may be measured by using 
10 standard assays Conventional for the parental protein. 

The LDV produced by expression in the genetically engineered 
host cells may then be purified, and if desired formulated into 
pharmaceutical compositions by conventional methods, often 
preferably by methods which are typically used in purifying 
15 and/or formulating the parental protein. It is contemplated that 
such pharmaceutical compositions containing the LDV in admixture 
with a pharmaceutical^ acceptable carrier will possess similar 
utilities to those of the parental proteins. 

in another, and preferred, aspect of this invention, the 
LDVs produced by recombinant means as mentioned above- are reacted 
with the desired amine-reactive compound under conditions 
permitting attachment of the compound to the £ -amino groups at 
remaining lysine residues in the peptide backbone of the LDV. 

The term "amine reactive compound" is defined herein as any 
compound having a reactive group capable of forming a covalent 
attachment to the Epsilon amine group of a lysine residue, 
included, among such compounds are hydrophllic polymers such as 
PEG and polypropylene glycol (PPG) ; compounds such as succinic 
anhydride; and others. Methods for such attachment are 
conventional, such as described in PCT application WO97/00056 and 
references described therein. However, by controlling the number 
and location of the remaining lysines in the LDV sequence, the 
number and location (s) of the attached moiety can be selectively 
controlled. Such control of attachment location and number 
enables the production of only certain selectively modified 
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molecules retaining the desired biological activity , rather than 
production of a heterogeneous mixture of variably modified 
molecules, only some of which may be active. 

Another aspect of the invention is therefore homogeneous 
5 compositions of modified LDVs as described herein, e.g. PEGylated 
LDVs. Specific embodiments of polypeptide LDVs of the invention 
include IL-2 which has arginine residues replacing lysine 
residues at one or more of the lysine residues at positions 8, 9, 
32, 35, 43, 48, 49, 54, 64, 76, and 97. A presently desirable 
10 example of such a modified XL- 2 has the natural lysine residue 
only at position 76, with all oth4r lysine residue positions as 
identified above being replaced by arginine residues and with 
lysine 76 being coupled to PEG. Amino acid numbers correlate 
with the numbering system used in Pig. 1 for the appropriate 
15 unmodified peptides. 

Similarly, one or more of the naturally occurring lysine 
residues in IL-3 (Fig, 2) at amino acid positions 10, 28, 66, 79. 
100, 110 and 116 may be converted to a suitable amino acid, such 
as arginine, to create a polypeptide LDV of the invention. For 
20 example, one such polypeptide has positions 10, 28, 100, 110 and 
116 converted to arginine and the remaining lysine residues at 
positions at 79 and 66 coupled to PPG. Alternatively one or more 
of the arginine residues may be converted to lysine residues. 
Table 2 below illustrates the positions and amino acid numbers of 
25 lysine and arginine residues in several exemplary polypeptides 
which can be altered according to the invention. The position 
numbers correspond to the appropriate figures 1 through 4 . In 
the case of EPO, it may be desirable to replace all but one to 
about four of the endogenous lysine residues (positions 20, 45, 
30 52, 97, 116, 140, 152 and 154) with arginine residues and/or to 
convert one or more of the endogenous arginine residues to lysine 
residues, especially at positions 4 and/or 10 and/or 162. 

Other modified peptides may be selected and produced in 
accordance with this invention as described for the above 
35 peptides, which are included as examples only. 
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Table 1: DNA encoding exemplary proteins of interest 
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35 



protein 

G-CSF 

GM-CSF 

M-CSF 

CSF-1 

IL-2 

IL-3 



vector & ATCC accession * 
PXMT2G-CSF (67514) 
pCSF-1 (39754) 
P3ACSF-69 (67092) 



IL-6 
tPA 
FVTII 
15 ATIII 
SOD 
EPO 



references 

(1) 
(2) 

(4) 

pBR322-aTCGF (39673) (6) 

pCSF-MLA (67154); CSF-16 (40246); (7) 
pHuIL3^2 (67319); pSHTL-3-1 (67326) 

PCSF309 (67153) ;pAL181 (40134) 

piVPA/1 (39891) ; J205 (39568) 

PSP64-VII1 (39812) ;pDGR-2 (53100) 

P91023 AT III-C3 (39941) 

RXFL13 (39989) 



(8) 

(9) 
(10) 

(11) 
(12) 
(13) 
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2. WO 86/00639; Wong'et al., Science 

3. WO 87/06954 

4> Kawasaki et al., 1985, Science 110:291-296 

6. US Serial Ho. 849,234 (filed April 6, 1986) 

7. PCT/US87/01702- ; 

8 . PCT/DS87/01611 • 
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Amine-reactive compounds will typically also react with the 
amino terminus of a polypeptide under the conditions described 
above, so long as the amino terminus is accessible to amine- 
reactive agents (i.e. reactive) and is not blocked. Therefore an 
5 alternatively modified polypeptide may be provided by blocking 
the reactive site oh the amino terminus of the selected polypep- 
tide LDV before reacting the LDV with the desired amine-reactive 
compound. Unblocking the N-termirius after the modifying moiety, 
e.g. polymer, has been covalently linked to LDV lysines will 

10 produce a modified polypeptide with polymer or other modifying 
moiety attached to the remaining lysines in the amino acid 
sequence of the LDV, but not at the amino terminus. Thus, 
compositions of polypeptides homogeneous for polymer attachment 
or lack of polymer attachment at the amino terminus are also 

15 encompassed by this invention. Additionally, for bacterial 
expression where the secretory leader-encoding DNA sequence is 
removed from the LDV-encoding DNA, it may be desirable to 
additionally modify the sequence such that it exrcodes an N-- 
terminus comprising Met-Pro instead of other N-termini such as 

20 Met-Ala-Pro. Such N-terminal modification permits more 
consistent removal of the N-terminal methionine. 

Thus, LDVs of this invention, modified as described, 
encompass LDVs containing other modifications as well, including 
truncation of the peptide sequence, deletion or replacement of 

25 other amino acids, insertion of new N-linked glycosylation sites, 
abolishment of natural N-linked glycosylation sites, etc. Thus, 
this invention encompasses LDVs encoded for by DNA molecules 
which are capable of hybridizing under stringent conditions to 
the DNA molecule encoding the parental polypeptide or protein so 

30 long as one or more lysine residues of . the parental peptide 
sequence is deleted or replaced with a different amino acid 
and/or one or more lysine residues are inserted into the parental 
peptide sequence and the resulting LDV is covalently modified as 
described herein. 

35 
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Because the method and compositions of this invention 
provide homogeneous modified polypeptides, the invention also 
encompasses such homogeneous compositions for pharmaceutical use 
which comprise a therapeutically effective amount of a modified 
5 LDV described above in a mixture with a pharmaceutically 
acceptable carrier. Such composition can be used in the same 
manner as that described for the natural or recombinant 
polypeptides. It is contemplated that the compositions will be 
used for treating a variety of conditions. For example, a 

10 modified IL-2 may be used to treat various cancers. A modified 
G-CSF can be used to treat neutropenia, eJg., associated with 
chemotherapy. A modified EPO may be used for treating various 
anemias. The exact dosage and method of administration will be 
determined by the attending physician depending on the 

15 particular modified polypeptide employed, the potency and 
pharmacokinetic profile of the particular compound as well as on 
various factors which modify the actions of drugs, for example, 
body weight, sex, diet, time of administration, drug combination, 
reaction sensitivities and severity of the particular case. 

20 Generally, the daily regimen should be in the range of the dosage 
for the natural or recombinant unmodified polypeptide, e.g. for a 
colony stimulating factor such as G-CSF, a range of l-ioo 
micrograms of polypeptide per kilogram of body weight. 
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The therapeutic method and compositions of the present 
invention may also include co-administration with other drugs or 
human factors. The dosage recited above would be adjusted to * 
5 compensate for such additional components in the therapeutic com- 
position or regimen. In the case of pharmaceutical compositions f 
containing modified lymphokine LDVs, for example, progress of the 
treated patient can be monitored by periodic assessment of the 
hematological profile, e.g. white cell count, hematocrit and the 
10 like. 

The following examples illustrate the method and 
compositions of the invention. 



20 



EXPERIMENT&T, MATERIALS. METHODS AND EXftMPUIS 

15 Sxample 1; Eucarvotic Expressi on Materials and Methods 

Eukaryotic cell expression vectors into which DNA sequences 
encoding LDVs of this invention may be inserted (with or without 
synthetic linkers, as required or desired) may be synthesized by 
techniques well known to those skilled in this art. The compon- 
ents of the vectors such as the bacterial replicons, selection 
genes, enhancers, promoters, and the like may be obtained from 
natural sources or synthesized by known procedures. See Kaufman 
et al " J v . Mol « Bloy, , 15£:60l-621 (1982); Kaufman, Proc . 
Natl. Acad. Snj. 51:689-693(1985). See also.WO 87/04187 (pMT2 
and pMT2— ADA) and US Serial No. 88,188, filed August 21, 
1987) (pxMT2) . Exemplary vectors useful for mammalian expression 
are also disclosed in the patent applications cited in Example 4, 
which are hereby incorporated by reference. Eucaryotic 
expression vectors useful in producing variants of this invention 
may also contain inducible promoters or comprise inducible 
expression systems as are known in the art. See US Serial No. 
893,115 (filed August 1, 1986) and PCT/US87/01871. - 

Established cell lines, including transformed cell, lines, 
are suitable as hosts.. Normal diploid cells, cell strains 
derived from in vitro culture of primary tissue, as well as 



25 



30 



35 
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primary explants (including relatively undifferentiated cells 
such as haematopoetic stem cells) are also suitable. Candidate 
cells need not be genotypically deficient in the selection gene 
so long as the selection gene is dominantly acting. 
5 The host cells preferably will be established mammalian cell 

lines. For stable integration of the vector DNA into chromosmal 
DNA, and for subsequent amplification of the integrated vector 
DNA, both by conventional methods , CHO (Chinese Hamster Ovary) 
cells are presently preferred. Alternatively, the vector DNA may 

10 include all or part of the bovine papilloma virus genome (Lusky 
et ai. , Cell . 36 : 391-401 (1984) and be carried in cell lines 
such as C127 mouse cells as a stable episomal element. Other 
usable mammalian cell lines include HeLa, COS-1 monkey cells, 
melanoma cell lines such as Bowes cells, mouse L-929 cells, 3T3 

15 lines derived from Swiss, Balb-c or NIH mice, BHK or HaK hamster 
cell lines and the like. 

Stable transformants then are screened for expression of the 
LDV product by standard immunological or activity assays. The 
presence of the DNA encoding the LDV proteins may be detected by 

20 standard procedures such as Southern blotting. Transient expres- 
sion of the procoagulant genes during the several days after 
introduction of the expression vector DNA into suitable host 
cells such as COS-1 monkey cells is measured without selection by 
activity or immunologic assay of the proteins in the culture 

25 medium. 

Following the expression of the DNA by conventionail means, 
the variants so produced may be recovered, purified, and/or 
characterized with respect to physiochemical , biochemical and/or 
clinical parameters, all by known methods. 

30 

Example 2 s Bacterial and Yeast expression 

Bacterial and yeast expression may be effected by inserting 
(with or without synthetic linkers, as required or desired) the 
DNA molecule encoding the desired LDV into a suitable vector (or 
35 inserting the parental DNA sequence into the vector and 
mutagenizing the sequence as desired therein) , then transforming 
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the host cells with the vector so produced using conventional 
vectors and methods as are known in the art, e.g. as disclosed in 
published PCT Application No. WO 86/00639. Transformahts are 
identified by conventional methods and may be subcloned if 
desired. Characterization of transf ormants and recombinant 
product so produced may be effected and the product recovered and 
purified, all as described in Example 1. 

For bacterial expression, the DNA sequences encoding the 
LDVs are preferably modified by conventional procedures to encode 
only the mature polypeptide and may optionally be modified to 
include preferred bacterial codons. 

Where the LDV comprises lysine residues at one or more 
locations otherwise occupied in the native sequence by consensus 
N-linked glycosylation sites or by an O-linked glycosylation 
site, modification (e.g. PEGylation) of the bacterial (or other) 
expression product (refolded if necessary or desired) results in 
a polypeptide more closely mimicing the corresponding native 
glycosylated eucaryotic expression product., 

Example 3; Insect Cell Expression 

Similarly, expression of the recombinant LDVs may be 
effected in insect cells, e.g. using the methods and materials 
disclosed therefor in published European Applications Nos. 0 155 
476 Al or 0 127 839 A2 and in Miller et al . , Genetic 
Engineering, Vol.8, pp. 277-2 98 (J.K. Setlow and A. Hollander, 
eds., Plenum Press, 1986); Pennock et al., 1984, Hoi. Cell. Biol. 
4_: (3)399-406; or Maeda et al., 1985, Nature 3J£: 592-594. 

Example 4t Mutagenesis Protocol 

Site directed mutagensis may be effected using conventional 
procedures known in the art. See e.g. published International 
Applications Nos. WO 87/07144 and WO 87/04722 and US Serial Nos. 
.099,938 (filed September 23, 1987) and 088,188 (filed August 21, 
1987) and the references cited therein. 
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Example 5; Exemplary Oligonucleotides for M utagenesis Reactions 

The following oligonucleotides were designed for the 
indicated exemplary mutagenesis reactions: 

5 JL. " sequence modification 

[ IL-2 K >R at position:] 

1 CA AGT TCT ACA AGG AAA ACA CAG C 8 

2 GT TCT ACA AAG MA ACA CAG CTA C 9 

3 GGA AAT AAT TAC AGS AAT CCC AAA C 32 
10 4 C AAG AAT CCC AGA CTC ACC AGG ATG C 35 

5 G CTC ACA TTT AGG TTT TAC ATG CCC 43 

6 G TTT TAC ATG CCC AGG AAG GCC ACA GAA C 48 

7 G TTT TAC ATG CCC AAG AGG GCC ACA GAA C 49 

8 GCC ACA GAA CTG AGA CAT CTT CAG TG 54 
15 9 GAA GAA GAA CTC AGA CCT CTG GAG G 64 

10 GCT CAA AGC AGA AAC TTT CAC TTA AG 76 

11 GTT CTG GAA CTA MS GGA TAT GAA AC 97 

R >K at position: ' * 

12 CCC AAA CTC ACC AAG ATG CTC ACA TTT 38 
20 13 C TTT CAC TTA AAA CCC AGG GAC 81 

14 CAC TTA AGA CCC AAG GAC TTA ATC AGC 83 

15 GAA TTT CTG AAC AAA TGG ATT ACC TTT TG 120 

f G-CSF K >R at position: ] 

16 GC TTC CTG CTC AGg TGC TTA GAG C 16 
25 17 G CAA GTG AGG AGG ATC CAG GGC G 23 

18 GCG CTC CAG GAG AGG CTG TGT GCC ACC 34 

19 GT GCC ACC TAC AGG CTG TGC CAC CCC 40 

R- >K at position:] 

20 GC TTA GAG CAA GTG AAG AAG ATC CAG GGC 22 
30 21 CT GCT TTC CAG AAA CGG GCA GGA GGG 146 

22 GCT TTC CAG CGC AAG GCA GGA GGG GTC C 147 

23 GAG GTG TCG TAC AAG GTT CTA CGC CAC C 166 

24 C CGC GTT CTA AAG CAC CTT GCC CAG CCC 169 



35 In the exemplary oligonucleotides depicted above regions 

designed to effect a codon alteration are underlined. It should 
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be understood of course that the depicted list of oligonucleo- 
tides is merely exemplary and not exclusive. The design and 
synthesis of alternative and additional oligonucleotides for 
mutagenesis in accord with this invention is well within the ? 
5 present skill in the art given the benefit of this disclosure. 

Synthesis of such oligonucleotides may be conveniently * 
effected using conventional automated DNA synthesis equipment and 
methods, typically following the manufacturer's instructions. 

One skilled in the art, of course, could readily design and 

10 sythesize other oligonucleotides for deletion of lysine codons or 
insertion thereof in DNA sequences encoding IL-2 or G-CSF. 
Additionally, one could also readily design and synthesize other 
oligonucleotides for similar mutagenesis of DNA encoding any 
desired protein or polypeptide for use in the production of LDVs 

15 of this invention. To modify more than one site mutagenesis may 
be carried out iteratively, or in some cases using an oligonucle- 
otide designed for mutagenesis at more than one site. For exam- 
ple, to modify a DNA molecule encoding IL-2 to encode R-48, R-49 
IL-2 one may mutagenize the parental DNA molecule iteratively 

2D- using oligonucleotides 6 and 7, depicted above. Alternatively, 
one could mutagenize with the following oligonucleotide: 
G CTC ACA TTT AAG TTT TAC ATG CCC AGG AGG - 
GCC ACA GAACTG AAA CAT CTT CAG 
which is designed to effect both mutagenesis reactions. 

25 By way of example, one may readily produce a DNA molecule 

and express it to yield one of the following G-CSF LDVs: 

Exemplary G-CSF LDVs ' 



30 


1. 


R-16 


G-CSF 




9. 




2. 


R-23 


G-CSF 




10. 




3. 


R-34 


G-CSF 




11. 




4. 


R-40 


G-CSF 




12. 




5. 


R-16, 


R-23 


G-CSF 


13. 


35 


6. 


R-16, 


R-34 


G-CSF 


14. 




7. 


R-16, 


R-40 


G-CSF 


15. 




8. 


R-23, 


R-34 


G-CSF 





R-23, R-40 G-CSF 

R-34, R-40 G-CSF 

R-16, R-23, R-34 G-CSF 

R-16, R-34, R-40 G-CSF . 

R-23, R-34; R-40 G-CSF" 

K-169, R-16, R-23, R-34, R-40 G-CSF' , 

R-16, R-34, K-147 G-CSF 
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Modification by methods described herein of such G-CSF LDVs, 
for example, provides the following exemplary modified G-CSF 
LDVs: 

[— K R-K K K — R-R R~ R ] 

5| ' 

— | R R-R R R R-R R— R 



10 



25 



I 

, j R-R R R -~ R-R R — R- 

I I 

| 1 -R R R -R-R -R — R- 



I I 

15 R R-R 1 1 R-R R~R- 

I I 

R R-R R R 1 - j R — R- 

20 | 

R R-R >R R R-R 1 — R- 



-R-R R R R-R- 



I I 

-R-R R R —R-R 1 — | 



-30 R R-R R R R-R R — R= |- 

I I I 
R R- I R | R- | ~R—R ■- 

35 

wherein j represents a modification in accordance with this 
invention, e.g. PEGylation, at each reactive lysine residue in 
the LDV. The parental peptide sequence of G-CSF is depicted 

40 schematically at the top in brackets indicating the relative 
locations of positions 16, 23, 34 and 40 (occuppied by lysine 
residues in G-CSF) and 22, 146, 147, 166 and 169 (occupied by 
arginine residues in G-CSF). As depicted schematically above, 
all lysines not intended as potential attachment sites were 

45 replaced with arginine. It should be understood of course, that 
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as previously mentioned, lysines not intended as potential 
attachment sites' may be replaced with other amino acids as well, 
or simply deleted, and one or more additional lysine residues may 
be added by insertion between or replacement of amino acid of the 
5 parental peptide sequence. 

Example fir synthe sis of DNA molecules encoding LDVs 

As an alternative to the production of LDV-encoding DNA by 
10 mutagenesis of a parental DNA sequence, it should be understood 
that the desired LDV-encoding DNA may be prepared synthetically. 
In that case, it will usually be desirable to synthesize the DNA 
in the form of overlapping oligonucleotides, e.g. overlapping 50- 
80mers, which together span the desired coding sequence: 



15 



20 



30 



Given a desired coding sequence, the design, synthesis, assembly 
and ligation, if desired, to synthetic linkers of appropriate 
oligonucleotides is well within the persent level of skill in the 
art. 

Example 7; Preparat ion of PEG-vlated IL-2 U>v 

25 a. DNA encoding the LDV 

A DNA molecule encoding IL-2 containing arginine residues in 
place of lysine residues at positions 8, 9, 32, 35, 43, 48, 54, 
64 9 * (and alanine in place of cystein at position 125) is 

synthesized as follows. The oligonucleotides depicted in Fig. 5A 
are synthesized by conventional means using a commercial 
automated DNA synthesizer following the supplier's instructions. 
Odd numbered oligonucleotides in Fig. 5 are "sense" strands, even 
numbered oliognucleotides are "anti-sense" strands. Oligonucleo- 
tides l and 2, 3 and 4, 5 and 6, 7 and 8, 9 and 10, 11 and 12, 13 
35 and 14 and 15 and 16 are annealed to each other, respectively, 
under conventional conditions, e.g. lOmM tris, PH 7.5, 20mM NaCl, 
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2mM MgCl2f and 10pM (combined oligonucleotides) /A of solution, 
with heating to 100 C followed by slow cooling over -2 h to 37 C. 
The eight mixtures are then combined and the duplexes were 
ligated to one another under standard conditions, e.g. 50mM tris, 
pH 7.4, lOmM MgCl 2/ lOmM DTT, and 1 mM ATP and 5 Weiss units of 
T4 ligase (New England BioLabs) at room temperature overnight 
(-16 h) . The mixture is electrophoresed through a 1% low gelling 
temperature agarose gel and a band of 480 bp was excised from the 
gel. That DNA molecule so produced encodes an Ala-125 IL-2 
having the K — >R mutations indicated above on an EcoRI/XhoI 
cassette. 

b. insertion into vector, expression and modification of the LDV 

The EcoRI/Xho I cassette may then be inserted into any 
desired vector, e.g. pxMT2 or derivatives thereof, using 
synthetic linkers as desired or necessary. Transformation of 
mammalian cells, e.g. COS or CHO cells, selection of 
transformants, amplification of gene copy n umb er in the case of 
CHO transformants, and culturing of the cells so obtained to 
produce the desired LDV, may be readily effected by conventional 
methods, such as those disclosed in the references in Table 1, 
above. The protein so produced may be recovered and further 
purified if desired, and PEGylated, and the PEGylated protein 
purified all by conventional methods. 

Example 8; Preparation of Alternative PEGylated IL-2 LDV 

Example 8 may be repeated using the oligonucleotides 
depicted in Pig. 5B in place of those depicted in Fig. 5A. The 
DNA molecule so produced encodes an LDV identical to that in 
Example 8, except that cystein at position 125 is retained. The 
corresponding PEGylated IL-2 LDV is thus produced. 

Example 9: — preparation of FEG-ylated R-16, R-34, K-147.G-CSF IflV 
pxMT2G-CSF may be mutagenized by conventional procedures 
using oligonucleotides 16, 18 and 22 depicted in Example 5 to 
produce a pxMT2G-CSF derivative encoding the title G-CSF LDV. 



WO 89/05824 



PCT/US88/04633 



26 



Transformation of mammalian cells, e.g. COS or CHO cells, 
selection of transf ormants , amplification of gene copy number in 
the case of CHO transf ormants, and culturing of the cells so 
obtained to produce the desired LDV, may be readily effected by 
conventional methods, such as those disclosed in the references 
in Table 1, above. The protein so produced may be recovered and 
further purified if desired, PEGylated by conventional procedures 
and the PEGylated protein purified by standard methods. 

The same or similar procedures may be used by one skilled in 
the art to attach polymers such as PEG or PPG or other moieties 
preferably hydrophilic moieties, to the other LDVs of the 
xnvention. Homogeneiety can be observed by conventional analysis 
of the modified LDVs so produced e.g. using standard SDS-PAGE or 
HPLC analysis. 



Numerous modifications may be made by one skilled in the art 
to the methods and compositions of the present invention in view 
of the disclosure herein, such modifications are believed to be 
encompassed by this invention as defined by the appended claims. 
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What is claimed is: 

1. A lysine depleted variant ( w LDV fl ) of a lymphokine, growth 
factor , hormone or vaccination agent having biological activity 
characterized by the deletion of , or amino acid substitution for, 
at least one lysine residue; and/or the insertion of a lysine 
residue within the polypeptide sequence and/or the replacement of 
a different amino acid within the polypeptide sequence with a 
lysine residue. 

2. An LDV of claim 1, wherein the amino acid substitution for 
lysine comprises the substitution of arginine for lysine, and/or 
the replacement of amino acid(s) with lysine comprises the 
replacement of at least one arginine with lysine. 

3. An LDV of claim 1 which contains 1-6 lysine residues* 

4. An LDV of claim 3 which contains 1-4 lysine residues. 

5. A lymphokine LDV of claim 1, wherein the lymphokine is IL-1, 
IL-2, IL-3, IL-4, IL-5, IL-6, G-CSF, M-CSF, GM-CSF or EPO. 

6. A DMA molecule encoding an LDV of claims 1-5. 

7. A procaryotic or eucaryotic host cell containing a DNA 
molecule of claim 6 in operable association with a transcription 
control sequence permitting expression of the DNA molecule and 
production of the LDV. 

8. A method for producing an LDV of a protein or polypeptide 
having biological activity characterized by the deletion of, or 
amino acid substitution for, at least one lysine residue; and/ or 
the insertion of a lysine residue within the polypeptide sequence 
and/or the replacement of a different amino acid within the 
polypeptide sequence with a lysine residue, which method 
comprises culturing a procaryotic or eucaryotic host cell 
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containing and capable of expressing a DNA molecule encoding the 
LDV under suitable conditions permitting production of the LDV. 

9. A modified LDV, wherein each lysine residue of the 
polypeptide sequence of the LDV is linked to a hydrophilic moiety 
by covalent linkage of the e-amino group of each lysine residue 
present within the polypeptide sequence of the LDV to a 
hydrophilic moiety selected from the group consisting of a 
polyalkylene glycol and succinic anhydride. 

10. A method for producing a homogeneous composition containing 
a modified LDV of claim 9 of the formula: 



(Lys) n - 



wherein « » represents the polypeptide backbone of the LDV 

Lys» represents a lysine residue within the polypeptide 
sequence, »y« represents a hydrophilic moiety covalently linked 
to the .-amino group of the lysine residue (s ) ; and «n« is an 
integer, the method comprising reacting the LDV with an amine 
11. Ct 1 1 : e i COffiP ° Und selected from the group consisting of a 
Polyalkylene glycol and succinic anhydride under suitable 
conditions, and in sufficient' amounts permitting the covalent 

IH^T*? hydr °^ ilic to each lysine residue 

present in the polypeptide backbone of the LDV. 

11. A method of claim 10 which further comprises recovering and 
purifying the modified LDV so produced. 

12. A modified LDV produced according to the method of claim 11. 

13. -A pharmaceutical composition containing a therapeutically 
effective amount of a modified LDV of claim 12 and .* 
pharmaceutical^ acceptable carrier. 
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FIG. 1 

5 ' TCTCTTTAATCACTACTCACAGTAACCTCAACTCCTGCCACA 
-20 -10 

Met Tyr Arg Met Gin Leu Leu Ser Cys lie Ala Leu Ser Leu Ala Leu 
ATG TAC AGG ATG CAA CTC CTG TCT TGC ATT GCA CTA AGT CTT GCA CTT 
50 

1 10 
Val Thr Asn Ser Ala Pro Thr Ser Ser Ser Thr Lys Lys Thr Gin Leu 
GTC ACA AAC AGT GCA CCT ACT TCA AGT TCT ACA AAG AAA ACA CAG CTA 
100 

20 

Gin Leu Glu His Leu Leu Leu Asp Leu Gin Met lie Ser Asn Gly lie 
CAA CTG GAG CAT TTA CTT CTG GAT TTA CAG ATG ATT TCG AAT GGA ATT 
150 

30 40 
Asn Asn Tyr Lys Asn Pro Lys Leu Thr Arg Met Leu Thr Phe Lys Phe 
AAT AAT TAC AAG AAT CCC AAA CTC ACC AGG ATG CTC ACA TTT AAG TTT 
200 

50 60 
Tyr Met Pro Lys Lys Ala Thr Glu Leu Lys His Leu Gin Cys Leu Glu 
TAC ATG CCC AAC AAG GCC ACA GAA CTG AAA CAT CTT CAG TGT CTA GAA 
250 

70 

Glu Glu Leu Lys Pro Leu Glu Glu Val Leu Asn Leu Ala Gin Ser Lys 
GAA GAA CTC AAA -CCT CTG GAG GAA GTG CTA AAT TTA GCT CAA AGC AAA 
350 

80 90 
Asn Phe His Leu Arg Pro Arg Asp Leu He Ser Asn He Asn Val He 
AAC TTT CAC TTA AGA CCC AGG GAC TTA ATC AGC AAT ATC AAC GTA ATA 

350 

100 

Val Leu Glu Leu Lys Gly Ser Glu Thr Thr Phe Met Cys Glu Tyr Ala 
GTT CTG GAA CTA AAG GGA TCT GAA ACA ACA TTC ATG TGT GAA TAT GCT 

400 

110 120 
Asp Glu Thr Ala Thr lie Val Glu Phe Leu Asn Arg . Trp He Thr Phe 
GAT GAG ACA GCA ACC ATT GTA GAA TTT CTG AAC AGA TGG ATT ACC TTT 

450 

130 133 
Cys Gin Ser He He Ser Thr Leu Thr 

TGT CAA AGC ATC ATC TCA ACA CTG ACT TGA TAA TTAAGTGCTTCCCACTTAAAA 

500 

<IATAT<ZAGGCCTTCTATTTATTTAAATATTTAAATTTTATATTTATTGTTGAATGTATGGTTO 
GCTACCTATTGTAACTATTATTCTTAATCTTAAAACTATAAATATGGATCTTTTATGATTCTT 
TTTGTAAGCCCTAGGGGCTC!TAAAATGGTTTCACTTATTTATCCCAAAATATTTATT3\frTATG 
TTGAATGTTAAATATAGTATCTATGTAGATTGGTTAGTAAAACTATTTAITAAATTTGATAAA 
TATAAAAA 
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FIG. 2 

9 24 39 54 

GATCCAAAC ATC AGC OGC CTC COC GTC CTC CTC CTC CTC CAA CTC CTC GTC CGC 
MET Ser Arg Lai Pro Val Leu Leu Lai Leu Gin Lai Leu Val Arg 

(1) 

69 84 [C] 99 

COC GGA CTC GAA GGP COC ATC ACC CAG ACA ACS TCC TES AAG ACA AGC TCG CTT 
Pro Gly Lai Gin Ala Pro MET Bar Gin Thr Thr Ser Lai Lys Thr Ser Trp Val 

129 144 159 

AAC TOC TCT AAC ATS ATC GAT GAA ATT ATA ACA CAC TEA AAG CAG CCA OCT TEG 
Asn Cys Ser Asn MET lie Asp Glu lie lie Tor His Leu Lys Gin Pro Pro Lai 

50 

174 189 204 

CCT TIG CTC GAC TTC AAC AAC CTC AAT GGG GAA GAC GAA GAC ATT CTC AUG GAA 
Pro Lai leu Asp Hie Asn Asn Lai Asn Gly Glu Asp Gin Asp He Lai MET Glu 

219 234 249 264 

AAT AAC CUT CGA AGS CCA AAC CTC GAG GGA TTC AAC AGS GCT GTC AAG ACT TEA 

Asn Asn Lai Arg Arg Pro Asn Lai Glu Ala Hie Asn Arg Ala Val Lys Ser Leu 

279 294 309 324 

CAG AAC GCA TCA GCA ATT GAG AGC ATT GET AAA AST CTC CDS CCA TOT CTC COC 
Gin Asn Ala Ser Ala lie Glu Ser lie Lai lys Asn Lai Lai Pro Cys Lai Pro 

100 

339 354 369 

CTC GCC AOS GOC GCA COC AOS CGA CAT OCA ATC CAT ATC AAG GAC GCT GAC TOG 
Lai Ala Thr Ala Ala Pro Tor Arg His Pro lie His lie Lys Asp Gly Asp Trp 
MET 

384 399 429 

AM? QVA TTC OQG AGG AAA CTC AOS TTC TAT CTC AAA AOC CTT GAG AAT GOG CAG 
Asn Glu Ffce Afcgr Arg lys Leu Bir Vha Tyr Leu Lys Hxr Leu Glu Asn Ala Gin 

130 

444 459 475 485 495 

GCT GAA GAG AOS ACT TIG AGC CTC GCG ATC TIT T-AGTOCAAOG T0CAGCT0GT TCTCEGGGCC 
Ala Glu Gin Thr Thr Leu Ser Leu Ala lia Hie - 

147 

™™™ 523 525 535 545 555 565 

TTCTCAOCAC AGOGOCTOQG GACATCAAAA ACAGCAGAAC T1CTGAAAOC TCTGGCTCAT CTCTCACACA 

575 585 595 605 615 625 635 

TPOCAGGAOC AGRAGGAJTT CAUOITiaOC TCOGGCATGA GATCAATTCT TAAITAICEA ATTTCIGAAA 

645 655 665 

TSK3CAGCTC COKTPIGGOC TDGTCOGGIT GTC7ITCTCA 
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FIG. 3 



10 20 30 40 50 

GAATTCCGGG AACGAAAGAG AAGCTCTATC TCCOCTCCAG GAGCOCAGCT MG AAC TCC TTC 

MET Asn Ser Phe 

65 80 95 110 

TOC ACA AGC GCC TTC GGT OCA GTT GCC TTC TCC CTG GGG CTG CTC CTG GIG TTG 
Ser Thr Ser Ala Hie Gly Pro Val Ala Ehe Ser Leu Gly Leu Leu Leu Val Leu 

125 (1) 140 155 170 

OCT GCT GCC TTC OCT GCC OCA GTA OCC CCA GGA GAA GAT TCC AAA GAT GTA GCC 
Pro Ala Ala Hie Pro Ala Pro Val Pro Pro Gly Glu Asp Ser Lys Asp Val Ala 

185 200 215 

GCC OCA CAC AGA CAG OCA CTC AOC TCP TCA GAA OGA ATT GAC AAA CAA ATT COG 
Ala Pro His Arg Gin Pro Leu Thr Ser Ser Glu Arg lie Asp Lys Gin lie Arg 

230 245 260 275 

TAG ATC CTC GAC QGC ATC TCA GCC CTG AGA AAG GAG ACA TGT AAC AAG ACT AAC 
Tyr lie Leu Asp Gly lie Ser Ala Leu Arg Lys Glu Thr Cys Asn Lys Ser Asn 

290 305 * 320 

AUG TGT GAA AGC AGC AAA GAG GGA CTG GGA GAA AAC AAC CTG AAC CTT OCA AAG 
MET Cys Glu Ser Ser Lys Glu Ala Leu Ala Glu Asn Asn Leu Asn Leu Pro Lys 

335 350 365 380 

ATC GCT GAA AAA GAT GGA TCC TTC CAA TCT GGA TTC AAT GAG GAG ACT TCC CTC 

MET Ala Glu Lys Asp Gly Cys Phe Gin Ser Gly Ehe Asn Glu Glu Thr Cys Leu 

395 410 425 440 

CTG AAA ATC ATC ACT GGT CTT TTC GAG TTT GAG GTA TAG CTA GAG TAG CTC CAG 
Val Lys lie lie Thr Gly Leu Leu Glu Phe Glu Val Tyr Leu Glu Tyr Leu Gin 

455 470 485 

AAC AGA TTT GAG ACT ACT GAG GAA CAA GCC AGA GCT CTG CAG ATC ACT ACA AAA 
Asn Arg Phe Glu Ser Ser Glu Glu Gin Ala Arg Ala Val Gin MET Ser Thr Lys 

500 515 530 545 

CTC CTC ATC CAG TTC CTC CAG AAA AAG GGA AAG AAT CTA GAT GGA ATA AOC ACC 
Val leu lie Gin Phe Leu Gin Lys Lys Ala Lys Asn Leu Asp Ala lie Thr Thr 

560 575 590 

OCT GAC CCA AOC ACA AAT GCC AGC CTC CTC AOS AAG CTC CAG GCA CAG AAC CAG 
Pro Asp Pro Thr Thr Asn Ala Ser Leu Leu Thr Lys Leu Gin Ala Gin Asn Gin 

605 620 635 650 

TGG CTC CAG GAC ATC ACA ACT CAT CTC ATT CTC CGC AGC TTT AAGTgAG TTC CTG 

Trp Leu Gin Asp MET Thr Thr His Leu He Leu Arg Ser Phe Lys Glu Phe Leu 
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FIG. 3 (oantinued) 

665 680 696 706 71« 

TRGCATGGGC AJXTCftGAIT GTIGITGTTA 

726 736 746 756 766 77S 

™„»™ 6 8 06 81 fi 826 836 846 8Sfi 

946 956 966 976 986 996 



M ™ GGMAGIGGC TftTCCftGm GAftTRXOdT TOECICAGfiG OCS^GflTCaTT TCTIGGRaftG 

1076 1086 1096 1106 1TT6 no* 

AftlGT&EftftA. TQSEriTEftT ACCAAIAAftT GGOffiETIftA AAAATTCA&& AAA&AAAAftA. AAAAAAAGAA 



TIC 
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FIG. 4 



1 10 
ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 

20 30 
AAG TGC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG 
Lys Cys LeU Glu Glri Val Arg Lys lie Gin Gly Asp Gly Ala Ala 

40 

CTC CAG GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu 

50 60 
GAG CTG GTG CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC 
Glu Leu Val Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro 

70 

CTG AGC AGC TGC CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG 
Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu 

80 90 
AGC CAA CTC CAT AGC GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG 
Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 

100 

GCC CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC ACC TTG GAC ACA 
Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr 

110 120 
CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC TGG CAG CAG 
Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin 

130 

ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG CAG CCC ACC CAG GGT 
Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin Gly 

140 150 
GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG CGC CGG GCA GGA GGG 
Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 

160 

GTC CTG GTT GCC TCC CAT CTG CAG AGC TTC CTG GAG GTG TCG TAC 
Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 

170 174 
CGC GTT CTA CGC CAC CTT GCC CAG CCC T 
Arg Val Leu Arg His Leu Ala Gin Pro 
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